support input_pos > 0 for prefill model #8127

billmguo · 2025-02-01T00:22:33Z

Summary: test input_pos>0 for prefill, not intention for landing but for sync with qc

Differential Revision: D68847677

Summary: test input_pos>0 for prefill, not intention for landing but for sync with qc Differential Revision: D68847677

pytorch-bot · 2025-02-01T00:22:37Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/8127

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures, 2 Unrelated Failures

As of commit 38d22fb with merge base 92e7dbd ():

NEW FAILURES - The following jobs have failed:

Check Labels / Check labels (gh)
RuntimeError: Error checking labels: PR does not have required labels
Lint / lintrunner / linux-job (gh)
>>> Lint for examples/qualcomm/oss_scripts/llama/model/static_llama.py:

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / test-eval_llama-mmlu-linux / linux-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / test-llava-runner-linux / linux-job (gh) (trunk failure)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2025-02-01T00:22:40Z

This pull request was exported from Phabricator. Differential Revision: D68847677

github-actions · 2025-02-01T00:23:14Z

This PR needs a `release notes:` label

If your changes are user facing and intended to be a part of release notes, please use a label starting with release notes:.

If not, please add the topic: not user facing label.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "topic: not user facing"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

billmguo · 2025-02-02T17:38:01Z

let me explain a little on this, Tokens, freq_cos, freq_sin, mask, k, v caches will be passed in both prefill and decode model
The freq_cos /freq_sin/mask . Here we remove input_pos since prefill model inference will be like AR-pre_fill_len model , the input_pos does not come into play, runtime can fill freq_cos/freq_sin/kv/mask information directly. let me know about your thoughts, in case you have better ideas to support multi-turn, that will be great!

support input_pos > 0 for prefill model

38d22fb

Summary: test input_pos>0 for prefill, not intention for landing but for sync with qc Differential Revision: D68847677

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 1, 2025

facebook-github-bot added the fb-exported label Feb 1, 2025

This was referenced Feb 11, 2025

Weekly pr metrics report - 2025-02-01..2025-02-07 wdvr/pytorch#6

Open

Weekly pr metrics report - 2025-02-01..2025-02-07 wdvr/pytorch#8

Open

This was referenced Feb 24, 2025

Weekly pr metrics report - 2025-02-01..2025-02-07 wdvr/pytorch#10

Open

Weekly pr metrics report - 2025-02-01..2025-02-07 wdvr/pytorch#14

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support input_pos > 0 for prefill model #8127

support input_pos > 0 for prefill model #8127

billmguo commented Feb 1, 2025

pytorch-bot bot commented Feb 1, 2025 •

edited

Loading

facebook-github-bot commented Feb 1, 2025

github-actions bot commented Feb 1, 2025

billmguo commented Feb 2, 2025

support input_pos > 0 for prefill model #8127

Are you sure you want to change the base?

support input_pos > 0 for prefill model #8127

Conversation

billmguo commented Feb 1, 2025

pytorch-bot bot commented Feb 1, 2025 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/8127

❌ 2 New Failures, 2 Unrelated Failures

facebook-github-bot commented Feb 1, 2025

github-actions bot commented Feb 1, 2025

This PR needs a release notes: label

billmguo commented Feb 2, 2025

pytorch-bot bot commented Feb 1, 2025 •

edited

Loading

This PR needs a `release notes:` label